Method and device for voice broadcast

ABSTRACT

A voice broadcast method includes: acquiring at least one piece of voice information; delivering the at least one piece of voice information to a voice broadcast device, wherein, the at least one piece of voice information includes voice identification information and a voice content; determining a first voice content to be broadcast by the voice broadcast device, wherein the first voice content is included in the at least one piece of voice information stored at the voice broadcast device; and sending, to the voice broadcast device, a first voice identification information corresponding to the first voice content, wherein the first voice identification information is used by the voice broadcast device to obtain the first voice content stored at the voice broadcast device, and to broadcast the first voice content when a preset voice broadcast condition is met.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation application of International PatentApplication No. PCT/CN2020/071482, filed on Jan. 10, 2020, which claimspriority to and benefits of the Chinese Patent Application No.201910298310.9, filed on Apr. 15, 2019. The contents of theabove-referenced applications are incorporated herein by reference intheir entirety.

TECHNICAL FIELD

This specification relates to the field of the Internet and voiceprocessing technologies, and in particular, to a voice broadcast methodand apparatus.

BACKGROUND

In network transactions, marketing activities of a transaction platformand the rights and interests of merchants are accurately delivered tocorresponding merchants by a voice broadcast method through the Internetof Things (IoT) long-tail devices to improve the number of collection ofpayments for merchants.

SUMMARY OF THE INVENTION

One or more embodiments of this specification aim to provide a voicebroadcast method and device, to improve the timeliness and accuracy ofthe voice broadcast.

One or more embodiments of this specification provide a voice broadcastmethod, including: acquiring at least one piece of voice information;delivering the at least one piece of voice information to a voicebroadcast device, wherein, the at least one piece of voice informationincludes voice identification information and a voice content;determining a first voice content to be broadcast by the voice broadcastdevice, wherein the first voice content is included in the at least onepiece of voice information stored at the voice broadcast device; andsending, to the voice broadcast device, a first voice identificationinformation corresponding to the first voice content, wherein the firstvoice identification information is used by the voice broadcast deviceto obtain the first voice content stored at the voice broadcast device,and to broadcast the first voice content when a preset voice broadcastcondition is met.

In one embodiment, the determining a first voice content to be broadcastby the voice broadcast device comprises: determining, according to apreset voice broadcast rule, whether the target voice broadcast deviceneeds to broadcast a voice content; and determining, in response todetermining that the voice broadcast device needs to broadcast a voicecontent, that the first voice content is to be broadcast by the voicebroadcast device.

In one embodiment, the determining, according to a preset voicebroadcast rule, whether the voice broadcast device needs to broadcast avoice content comprises: determining, that the voice broadcast deviceneeds to broadcast the first voice content, in response to: a totalnumber of broadcasts in a first designated time segment is less than afirst preset threshold, a first duration from the last broadcast of avoice content reaches a preset duration, or a number of consecutivebroadcasts of voice contents in a second designated time segment reachesa second preset threshold.

In one embodiment, the determining a first voice content to be broadcastby the voice broadcast device comprises one of the following:determining the first voice content as a designated voice contentaccording to a preset carousal broadcasting rule, wherein the carousalbroadcasting rule is configured to specify a broadcasting order of voicecontents in the at least one piece of voice information; and determiningthe first voice content according to personal information or preferenceinformation of a user associated with the voice broadcast device.

In one embodiment, the acquiring at least one piece of voice informationcomprises: acquiring related information of the voice broadcast device,wherein the related information includes at least one of the following:user information of a user associated with the voice broadcast device,behavior information of the user, preference information of the user,and location information of the voice broadcast device; and acquiring,according to the related information, the at least one piece of voiceinformation corresponding to the voice broadcast device.

In one embodiment, the delivering the at least one piece of voiceinformation to a voice broadcast device comprises: determining, if thereis a task of delivering voice information to other voice broadcastdevices, priorities respectively corresponding to the voice broadcastdevice and other voice broadcast devices; and determining, according tothe priorities respectively corresponding to the voice broadcast deviceand the other voice broadcast devices, a delivery time of the at leastone piece of voice information, and delivering, according to thedelivery time, the at least one piece of voice information to the voicebroadcast device.

In one embodiment, the method further includes: obtaining, when anupdate is performed on the at least one piece of voice information, theupdated at least one piece of voice information, wherein the updateincludes at least one of adding, deleting, or modifying the at least onepiece of voice information; and delivering the updated at least onepiece of voice information to the voice broadcast device.

In one embodiment, the first voice content includes a transaction voicecontent, and the determining a first voice content to be broadcast bythe voice broadcast device includes: determining, according to a numberof transactions or a transaction amount of a user associated with thevoice broadcast device, the transaction voice content to be broadcast bythe voice broadcast device.

In one embodiment, the determining, according to a number oftransactions or a transaction amount of the user, the transaction voicecontent to be broadcast by the voice broadcast device includes:acquiring a preset mapping relationship between a range of the number oftransactions or a range of transaction amounts and the voice content;determining a range of a first number of transactions, wherein thenumber of transactions falls within the range of the first number oftransactions, or determining a first transaction amount range, whereinthe transaction amount falls within the first transaction amount range;and determining, according to the range of the first number oftransactions, the first transaction amount range, and the mappingrelationship, the transaction voice content to be broadcast by the voicebroadcast device.

One or more embodiments of this specification provide a voice broadcastdevice, including: one or more processors; a memory storing instructionsexecutable by the one or more processors to cause the one or moreprocessors to perform operations comprising: acquiring at least onepiece of voice information; delivering the at least one piece of voiceinformation to a voice broadcast device, wherein, the at least one pieceof voice information includes voice identification information and avoice content; determining a first voice content to be broadcast by thevoice broadcast device, wherein the first voice content is included inthe at least one piece of voice information stored at the voicebroadcast device; and sending, to the voice broadcast device, a firstvoice identification information corresponding to the first voicecontent, wherein the first voice identification information is used bythe voice broadcast device to obtain the first voice content stored atthe voice broadcast device, and to broadcast the first voice contentwhen a preset voice broadcast condition is met.

One or more embodiments of this specification provide one or morenon-transitory computer-readable storage media configured withinstructions executable by one or more processors to cause the one ormore processors to perform operations, including: acquiring at least onepiece of voice information; delivering the at least one piece of voiceinformation to a voice broadcast device, wherein, the at least one pieceof voice information includes voice identification information and avoice content; determining a first voice content to be broadcast by thevoice broadcast device, wherein the first voice content is included inthe at least one piece of voice information stored at the voicebroadcast device; and sending, to the voice broadcast device, a firstvoice identification information corresponding to the first voicecontent, wherein the first voice identification information is used bythe voice broadcast device to obtain the first voice content stored atthe voice broadcast device, and to broadcast the first voice contentwhen a preset voice broadcast condition is met.

In a technical solution adopting one or more embodiments of thisspecification, voice information is pre-delivered to a target voicebroadcast device, thereby determining the first voice content to bebroadcast by the target voice broadcast device, and the first voiceidentification information corresponding to the first voice content issent to the target voice broadcast device, so that the target voicebroadcast device is capable of obtaining the first voice content locallyaccording to the first voice identification information. It can be seenthat the technical solution makes it unnecessary for the target voicebroadcast device to download from the server when voice needs to bebroadcast, which solves the delay problem when the voice information isdownloaded in real time. Moreover, because the voice information isdownloaded in advance by the server and stored locally, even if thevoice information is changed, there is no need to make any improvementin the device hardware, thereby solving the problem that the update ofvoice information requires upgrading of hardware such that theinformation cannot be broadcast in time. This improves the timelinessand accuracy of the voice broadcast.

BRIEF DESCRIPTION OF DRAWINGS

To illustrate the technical solutions in one or more embodiments of thisspecification or in the prior art more clearly, the following brieflyintroduces the accompanying drawings required for describing theembodiments or the prior art. Apparently, the accompanying drawings inthe following description show merely some embodiments recorded in oneor more embodiments of this specification, and a person of ordinaryskill in the art may still derive other drawings from these accompanyingdrawings without creative efforts.

FIG. 1 is a flowchart of a voice broadcast method according to oneembodiment of this specification.

FIG. 2 is a flowchart of a voice broadcast method according to oneembodiment of this specification.

FIG. 3 is a block diagram of a voice broadcast device according to oneembodiment of this specification.

FIG. 4 is a block diagram of a voice broadcast device according to oneembodiment of this specification.

FIG. 5 is a schematic block diagram of a voice broadcast deviceaccording to one embodiment of this specification.

FIG. 6 is a schematic block diagram of a voice broadcast deviceaccording to one embodiment of this specification.

DETAILED DESCRIPTION OF EMBODIMENTS

One or more embodiments of this specification provides a voice broadcastmethod and device, to improve the timeliness and accuracy of the voicebroadcast.

Generally, all voice files are stored in IoT devices in advance, andvoice broadcast is triggered by the corresponding scenario. Thedisadvantage of this method is that the device needs to be upgraded tocontinue to broadcast if the voice content change. It is not flexible.Another method is to download voice files every time when the IoT deviceneeds to broadcast, but because the hardware of the IoT device is tooweak and certain delay occurs in downloading, resulting in the firstpiece or even the first a few pieces of voice cannot be broadcast.

In order to enable those skilled in the art to better understand thetechnical solutions in one or more embodiments of this specification,the following clearly and completely describes the technical solutionsin one or more embodiments of this specification with reference to theaccompanying drawings in one or more embodiments of this specification.Apparently, the described embodiments are only some rather than all ofthe embodiments of this specification. All other embodiments obtained bythose ordinary skilled in the art based on one or more embodiments ofthis specification without creative work shall fall within theprotection scope of one or more embodiments of this specification.

FIG. 1 illustrates is a flowchart of a voice broadcast method accordingto an embodiment of this specification, including the following steps.

At S102, the method may include acquiring at least one piece of voiceinformation to deliver the at least one piece of voice information to atarget voice broadcast device. The voice information includes voiceidentification information and a voice content.

At S104, the method may include determining the first voice content tobe broadcast by the target voice broadcast device.

At S106, the method may include sending a first voice identificationinformation corresponding to the first voice content to the target voicebroadcast device.

The first voice identification information is used by the target voicebroadcast device to obtain the first voice content and broadcast thefirst voice content when a preset voice broadcast condition is met. Thepreset voice broadcast conditions may include: broadcasting other voicecontent of at least one piece of voice information that is not deliveredby the server. For example, the target voice broadcast device currentlyneeds to broadcast the first voice content. When the target voicebroadcast device broadcasts other voice content, the first voice contentmay be broadcast after the other voice content broadcast ends.

In some embodiments, the target voice broadcast device stores at leastone piece of voice information delivered by the server in advance,including voice identification information and corresponding voicecontent. Therefore, after receiving the first voice identificationinformation delivered by the server, the target voice broadcast devicecan locally obtain the first voice content corresponding to the firstvoice identification information for broadcasting, thereby avoiding thedelay caused by downloading the voice content from the server.Especially for IoT devices with weak hardware environment, it can avoidmore problems of delay in downloading voice.

In some embodiments, the voice information is pre-delivered to thetarget voice broadcast device, thereby determining the first voicecontent to be broadcast by the target voice broadcast device, and thefirst voice identification information corresponding to the first voicecontent is sent to the target voice broadcast device, so that the targetvoice broadcast device is capable of obtaining the first voice contentlocally according to the first voice identification information. Undersuch situation, it is unnecessary for the target voice broadcast deviceto download from the server when voice needs to be broadcast, whichsolves the delay problem when the voice information is downloaded inreal time. Moreover, because the voice information is downloaded inadvance by the server and stored locally, even if the voice informationis changed, there is no need to make any improvement in the devicehardware, thereby solving the problem that the update of voiceinformation requires upgrading of hardware such that the informationcannot be broadcast in time. This improves the timeliness and accuracyof the voice broadcast.

In one embodiment, when the at least one piece of voice information isupdated, the updated at least one piece of voice information isacquired, and the updated at least one piece of voice information isdelivered to the target voice broadcast device. The update to the voiceinformation includes at least one or more actions of adding, deleting,or modifying the voice information.

In on embodiment, by detecting whether the voice information is updatedin real time and sending the updated voice information to the targetvoice broadcast device when the voice information is updated, thehardware of the target voice broadcast device does not need to beupgraded even if the server-side voice information is changed, therebysolving the problem that the update of voice information requiresupgrading of hardware such that the information cannot be broadcast intime. This improves the timeliness and accuracy of the voice broadcast.

In one embodiment, related information of the target voice broadcastdevice may be obtained, and then at least one piece of voice informationcorresponding to the target voice broadcast device is determinedaccording to the related information, and the determined at least onepiece of voice information is the voice information that is obtained inS102 and delivered to the target voice broadcast device. The relatedinformation of the target voice broadcast device includes userinformation of the device user, behavior information of the device user,preference information of the device user, and device locationinformation.

In some embodiments, the user information of the device user may includea user's personal information, such as age, gender, industry category,industry identity, and the like. According to the user information of adevice user, the voice information corresponding to the target voicebroadcast device may be determined.

For example, if the age of the device user is between 20 and 30 years,and it is assumed that users in this age range prefer food, the voiceinformation related to food may be determined to be voice informationcorresponding to the target voice broadcast device. As another example,if the industry identity of the device user is a store owner, and thetarget voice broadcast device is used to broadcast daily paymentcollection information (including amounts and number of payments), thevoice information corresponding to the target voice broadcast device maybe determined according to the payment information of the device user,for example, different amounts and/or different times of paymentscorrespond to different voice messages.

The behavior information of the device user may include feedbackinformation of the device user on the historical broadcasted voicecontents and may also include behavior information of the device usercorresponding to the voice content broadcasted by the device.

According to the feedback information of the device user on the voicecontent in the historical broadcast, the voice information correspondingto the target voice broadcast device may be determined. The feedbackfrom the device user on the voice content may include a user's behavioron an event related to the voice content. For example, in the voicecontent of historical broadcasts, device users give more feedback on thevoice content related to “free cash withdrawal” (for example, users whohave participated in free cash withdrawal activities many times), thenit may be determined that the voice information related to “free cashwithdrawal” is the voice information corresponding to the target voicebroadcast device.

The voice content broadcasted by the target voice broadcast device candetermine the behavior information of the device user. When it isassumed that the device user is a product seller and the target voicebroadcast device is a transaction voice broadcast device, the voicecontent broadcast by the transaction voice broadcast device includes thenumber of transactions and the transaction amount of the commodityseller, that is, the behavior information of the product seller. In thevoice information stored on the server, the number of transactions orthe transaction amount varies depending on the corresponding voiceinformation.

In some embodiments, because at least one piece of voice informationdelivered to the target voice broadcast device is determined accordingto the user information of the device user and/or the behaviorinformation of the device user, and the user information of the deviceuser and/or the behavior information of the device user is notinvariable, especially the behavior information of the device user maychange in a short period of time, such that the server can determine atleast one piece of information delivered to the target voice broadcastdevice according to the changing information, reflecting the real-timenature and diversity of the voice message delivered by the server.

In one embodiment, the related information may further include devicelocation information, device user preference information, and the like.If the device-related information includes device location information,the voice information corresponding to the target voice broadcast devicemay be determined according to the device location information. Forexample, the server side distinguishes cloud storage space based ongeographic location information, that is, each geographic locationcorresponds to its own cloud storage space, and each cloud storage spacestores the same or different voice information. If the device locationinformation is Beijing, the voice information in the cloud storage spacecorresponding to Beijing is first filtered, and then the voiceinformation in the cloud storage space corresponding to Beijing may bedirectly determined as the voice information corresponding to the targetvoice broadcast device. According to other factors (such as userinformation, user behavior information, etc.), the voice informationcorresponding to the target voice broadcast device may be furtherfiltered from the voice information in the cloud storage spacecorresponding to Beijing.

If the device-related information includes preference information of thedevice user, the voice information corresponding to the target voicebroadcast device may be determined according to the preferenceinformation of the device user. The server may determine the preferenceinformation of the device user according to the specified operation ofthe device user on the voice information and may also determine thepreference information of the device user according to the userinformation and/or behavior information of the device user. The behaviorinformation of the device user may include feedback information of thedevice user on the historical broadcast voice content. The feedback fromthe device user on the voice content may include a user behavior on anevent involved in the voice content. For example, in the historicalbroadcast of voice contents, device users have more feedback on voicecontents related to “free cash withdrawal” (for example, users who haveparticipated in free cash withdrawal activities for many times), andvoice information related to device user preferences and “free cashwithdrawal” may be determined.

In one embodiment, after at least one piece of voice information isdelivered to the target voice broadcast device, according to a presetvoice broadcast rule, whether the target voice broadcast device needs tobroadcast the voice content in at least one piece of voice informationdelivered may be determined. When it is determined that the target voicebroadcast device needs to broadcast the voice content in at least onepiece of voice information delivered, the first voice content to bebroadcast by the target voice broadcast device is determined. Thefollowing rules (that is, preset voice broadcast rule) may be used todetermine whether the target voice broadcast device needs to broadcastthe voice content: when the broadcast information corresponding to thetarget voice broadcast device meets the preset broadcast condition, saidvoice content to be broadcast by the target voice broadcast device isdetermined.

The broadcast information may include, but is not limited to, any one ormore of the following: (1) the total number of broadcasts in the firstdesignated time segment, where the total number of broadcasts refers tothe sum of the number of times that the target voice broadcast devicebroadcasts at least the following voice contents: the voice content inat least one piece of voice information delivered by the server, and,the local voice content pre-stored in the target voice broadcast device;(2) the first duration from the voice content in at least one piece ofvoice information delivered from the last broadcast; and (3) the numberof consecutive broadcasts of other voice contents within the seconddesignated time segment, where other voice content refers to voicecontent other than voice content in at least one piece of voiceinformation delivered by the server, for example, other voice contentrefers to voice content locally pre-stored in the target voice broadcastdevice.

Based on the foregoing broadcast information, the preset broadcastconditions may include at least one of the following: the total numberof broadcasts in the first designated time segment is less than thefirst preset threshold, the first duration of the voice content in atleast one piece of voice information delivered from the last broadcastreaches the preset duration, and the number of consecutive broadcasts ofother voice content within a second designated time segment reaches asecond preset threshold, and the like. For example, the first designatedtime segment is the current whole day. If the total number of broadcastsof the target voice broadcast device on the day is less than 50 times(that is, the first preset threshold), and the duration of the voicecontent in at least one piece of voice information delivered from thelast broadcast reaches 1 hour (that is, the preset duration), the voicecontent in at least one piece of voice information to be broadcast bythe target voice broadcast device is determined. As another example, thesecond designated time segment is the last 5 hours. If the target voicebroadcast device has continuously broadcast other voice content for 10times in the last 5 hours (that is, the second preset threshold), buthas not broadcast the voice content in at least one piece of voiceinformation delivered from the server, it is determined that the targetvoice broadcast device needs to broadcast the delivered voice content inat least one piece of voice information. After it is determined that thetarget voice broadcast device needs to broadcast the delivered voicecontent in at least one piece of the voice information, the first voicecontent that the target voice broadcast device needs to broadcast may bedetermined.

In one embodiment, the first voice content that the target voicebroadcast device needs to broadcast may be determined in any of thefollowing ways: (1) determining the first voice content according to apreset period broadcasting rule, where the period broadcasting rule isused to specify a carousal broadcasting order of the voice contents inat least one piece of voice information. For example, if 20 voicecontents are delivered to the target voice broadcast device, then thecarousal broadcasting may be carried out for the 20 voice contents. Theserver may first determine the voice content that was broadcasted by thetarget voice broadcast device last time, and then determine the firstvoice content to be broadcast this time according to the carousalbroadcasting order of the 20 voice contents. For example, the carousalbroadcasting rule specifies that the carousal broadcasting order of eachvoice content in at least one piece of voice information is: voicecontent A, voice content B, voice content C, etc. If the lastbroadcasted voice content is voice content B, then the first voicecontent to be broadcast this time is voice content C.

(2) determining that a designated voice content is the first voicecontent. The device user may specify the first voice content to bebroadcasted by the target voice broadcast device in advance. The deviceuser may pre-designate only the first voice content to be broadcast bythe target voice broadcast device this time or may pre-designate thevoice content to be broadcast by the target voice broadcast device eachtime within a certain time segment. For example, the device userspecifies that the target voice broadcast device broadcasts voicecontent A when it needs to broadcast voice content for the first timetoday, and broadcasts voice content B when it needs to broadcast voicecontent for the second time today, and so on. The server can determinethe first voice content that the target voice broadcast device needs tobroadcast this time according to the information specified in advance bythe device user.

(3) determining the first voice content according to personalinformation and/or preference information of a device user. The personalinformation of a device user may include age, gender, industry category,industry identity, and so on. For example, if the industry identity ofthe device user is a marketer, the voice content that is conducive topromoting marketing may be determined as the first voice content to bebroadcast by the target voice broadcast device this time, such as “10payment collections today will give you free withdrawal benefits”.

The preference information of the device user may be determinedaccording to the specified operation of the device user on the voiceinformation, and the preference information of the device user may bealso determined according to the user personal information and/orbehavior information of the device user. The behavior information of thedevice user may include feedback information of the device user on thehistorical broadcast voice content. The feedback from the device user onthe voice content may include a user behavior on an event involved inthe voice content. For example, in the historical broadcast of voicecontents, device users have more feedback on voice contents related to“free cash withdrawal” (for example, users who have participated in freecash withdrawal activities many times), then voice information relatedto device user preferences and “free cash withdrawal” may be determined.Then, the voice content related to the “free cash withdrawal” may beselected in at least one piece of voice information as the first voicecontent to be broadcast by the target voice broadcast device.

In one embodiment, when it is determined that the target voice broadcastdevice needs to broadcast the first voice content in the delivered voiceinformation, the first voice identification information corresponding tothe first voice content is sent to the target voice broadcast device,wherein the first the voice identification information is used by thetarget voice broadcast device to obtain the first voice content, andbroadcast the first voice content when a preset voice broadcastcondition is reached.

In one embodiment, when at least one piece of voice information isdelivered to the target voice broadcast device, if there is currently atask of delivering voice information to other voice broadcast devices,the priorities corresponding to the target voice broadcast device andother voice broadcast devices are determined respectively. Then,according to the respective priorities of the target voice broadcastdevice and other voice broadcast devices, the delivery time of at leastone piece of voice information is determined, and at least one piece ofvoice information is delivered to the target voice broadcast deviceaccording to the delivery time. The delivery time includes the timebefore the voice information is delivered to other voice broadcastdevices, or the time after the voice information is delivered to othervoice broadcast devices. Specifically, if the priority corresponding tothe target voice broadcast device is higher, at least one piece of voiceinformation is first delivered to the target voice broadcast device, andthen the voice information is delivered to other voice broadcastdevices; if the priority corresponding to the target voice broadcastdevice is lower, the voice information is first delivered to other voicebroadcast devices, and then at least one piece of voice information isdelivered to the target voice broadcast device.

In this embodiment, the priority corresponding to each voice broadcastdevice may be preset by a user. For example, number 0-10 representspriority levels corresponding to each voice broadcasting device. Thelarger the number is, the higher the priority is. When voice informationis delivered to the target voice broadcast device A and there is a taskof delivering voice information to voice broadcast device B, at thistime, the priority of the target voice broadcast device A is compared tothat of the voice broadcast device B. If the user pre-sets the prioritycorresponding to the target voice broadcast device A to 10 and thepriority corresponding to voice broadcast device B to 9, it indicatesthat the target voice broadcast device A has a higher priority and thatthe corresponding voice information can be first delivered to the targetvoice broadcast device A, and then the corresponding voice informationis delivered to the voice broadcast device B.

In some embodiments, when a task of delivering voice information is tobe delivered to multiple voice broadcast devices, the execution order ofeach task may be determined according to the respective priority of thevoice broadcast devices, thereby avoiding the pressure when multipleservers are performing simultaneous tasks on the server. The voicebroadcast device with higher priority can receive the voice informationfaster to achieve quicker broadcasting of the voice content to meet therequirements of device users.

In some embodiments, the target voice broadcast device is a transactionvoice broadcast device described as an example in the specification ofthe voice broadcast method. FIG. 2 illustrates is a flowchart of a voicebroadcast method according to an embodiment of this specification,including the following steps.

At S202, the method may include acquiring at least one piece of voiceinformation and deliver at least one piece of voice information to atransaction voice broadcast device. The voice information includes voiceidentification information and voice content.

At S204, the method may include determining a transaction voice contentto be broadcast by the transaction voice broadcast device.

At S206, the method may include sending the transaction voiceidentification information corresponding to the transaction voicecontent to the transaction voice broadcast device.

The transaction voice identification information is used by thetransaction voice broadcast device to obtain the transaction voicecontent and broadcast the transaction voice content when it meets apreset voice broadcast condition.

In one embodiment, the transaction information of the device user of thetransaction voice broadcast device may be obtained wherein thetransaction information may include information such as number oftransactions, transaction amount, and the like. Further, at least onepiece of voice information corresponding to the transaction voicebroadcast device is determined according to the transaction informationof the device user wherein the at least one piece of voice informationincludes voice identification information and voice content, and thenthe at least one piece of voice information corresponding to thetransaction voice broadcast device is delivered to the transaction voicebroadcast device. The server can generate statistics of transactioninformation of device users in real time. Transaction information thatfalls into different ranges corresponds to different voice information.For example, the number of transactions per day ranging from 50 to 100corresponds to voice information set A, the number of transactions perday ranging from 100 to 200 corresponds to voice information set B, andso on.

In one embodiment, after the at least one piece of voice information isdelivered to the transaction voice broadcast device, it may bedetermined, according to the preset voice broadcast rule, whether thetransaction voice broadcast device needs to broadcast the voice contentin the at least one piece of voice information. When it is determinedthat the transaction voice broadcast device needs to broadcast the voicecontent, the transaction voice content to be broadcast by thetransaction voice broadcast device is determined. For example, firstly,the broadcast information corresponding to the transaction voicebroadcast device is counted; secondly, whether the broadcast informationmeets a preset broadcast condition is determined. If the broadcastinformation meets a preset broadcast condition, it is determined thatthe transaction voice broadcast device needs to broadcast thetransaction voice content in delivered at least one piece of voiceinformation

The broadcast information corresponding to the transaction voicebroadcast device may include, but is not limited to, any one or more ofthe following: (1) the total number of broadcasts in the first specifiedtime segment, wherein the total number of broadcasts refers to the sumof the number of times that the transaction voice broadcast devicebroadcasts at least the following voice contents: the voice content inat least one piece of voice information delivered by the server, and,the local voice content pre-stored in the transaction voice broadcastdevice; (2) the first duration from the transaction voice content in atleast one piece of voice information delivered from the last broadcast;and (3) the number of consecutive broadcasts of other voice contentswithin the second designated time segment, wherein other voice contentrefers to voice content other than transaction voice content in at leastone piece of voice information delivered by the server, for example,other voice content refers to voice content locally pre-stored in thetransaction voice broadcast device.

Based on the foregoing broadcast information, the preset broadcastconditions may include at least one of the following: the total numberof broadcasts in the first designated time segment is less than thefirst preset threshold, the first duration from the last broadcast ofthe transaction voice content reaches the preset duration, and thenumber of consecutive broadcasts of other voice content within a seconddesignated time segment reaches a second preset threshold, and the like.For example, the first designated time segment is the current whole day.If the total number of broadcasts by the transaction voice broadcastdevice on the day is less than 50 (that is, the first preset threshold),and the duration from the last broadcast of the transaction voicecontent reaches 1 hour (that is, the preset duration), it is determinedthat the transaction voice broadcast device needs to broadcast thetransaction voice content.

As another example, the second designated time segment is the last 5hours. If the transaction voice broadcast device has continuouslybroadcast other voice content for 10 times in the last 5 hours (that is,the second preset threshold), but the transaction voice content has notbeen broadcast, it is determined that the transaction voice broadcastdevice needs to broadcast the transaction voice content.

In one embodiment, the transaction voice content in the voiceinformation to be broadcast by the transaction voice broadcast devicemay be determined according to the number of transactions and/or thetransaction amount of the device user; and then the transaction voiceidentification information corresponding to the transaction voicecontent is sent to the transaction voice broadcast device. Thetransaction voice identification information is used for the transactionvoice broadcast device to obtain the transaction voice content andbroadcast the transaction voice content when it meets the preset voicebroadcast condition. The preset voice broadcast conditions may include:broadcasting other voice content of at least one piece of voiceinformation that is not delivered by the server. That is, thetransaction voice broadcast device currently needs to broadcast thetransaction voice content. When the transaction voice broadcast devicebroadcasts other voice content, the transaction voice content may bebroadcast after the other voice content broadcast ends. For example, thetransaction voice broadcast device currently needs to broadcast thetransaction voice content “10 payment collections today will give youfree withdrawal benefits”. After the transaction voice broadcast devicebroadcasts other voice content such as “Alipay received RMB XX”, thetransaction voice content “10 payment collections today will give youfree withdrawal benefits” is broadcast additionally.

When the transaction voice content in the voice information that thetransaction voice broadcast device needs to broadcast is determinedaccording to the number of transactions and/or the transaction amount,the following steps may be performed.

At first step, a preset mapping relationship between a range of thenumber of transactions and/or a range of transaction amounts and thevoice content may be acquired.

At second step, a range of a first number of transactions and/or a firsttransaction amount range may be determined. The number of transactionsmay fall within the range of the first number of transactions, and thetransaction amounts may fall within the first transaction amount range.

At third step, according to a range of a first number of transactions,the first transaction amount range, and the mapping relationship, thetransaction voice content in the voice information that the transactionvoice broadcast device needs to broadcast may be determined.

For example, in the mapping relationship pre-stored by the server, thevoice content corresponding to the number of transactions ranging from 1to 5 is “5 payment collections today will give you more-pay more-returnbenefits.” The voice content corresponding to the number of transactionsranging from 6 to 10 is “10 payment collections today will give you freewithdrawal benefits”. If the transaction voice broadcast device is aboutto broadcast the 5th collection voice content and the server determinesthat the transaction voice broadcast device currently needs to broadcastthe transaction voice content, then the transaction voice identificationinformation of the transaction voice content “5 payment collectionstoday will give you more-pay more-return benefits” is delivered to thetransaction voice broadcast device, so that the transaction voicebroadcast device can additionally broadcast the transaction voicecontent “5 payment collections today will give you more-pay more-returnbenefits” after the broadcast of the fifth collection voice content. Ifthe transaction voice broadcast device is about to broadcast the 10thcollection voice content and the server determines that the transactionvoice broadcast device currently needs to broadcast the transactionvoice content, then the transaction voice identification information ofthe transaction voice content “10 payment collections today will giveyou free cash withdrawal benefits” is delivered to the transaction voicebroadcast device, so that the transaction voice broadcast device canadditionally broadcast the transaction voice content “10 paymentcollections today will give you free cash withdrawal benefits” after thebroadcast of the 10th collection voice content.

From the above embodiments, when the technical solution is applied to atransaction voice broadcast scenario, the voice information to bedelivered to the transaction voice broadcast device is determinedaccording to user information and/or behavior information of the deviceuser, and the voice information to be delivered to the transaction voicebroadcast device is determined according to the transaction informationof the device user. The strategy of real-time delivery of the voiceinformation to the transaction voice broadcast device is realized. Thetechniques require no need to make improvement in the device hardware,thereby solving the problem that the update of voice informationrequires upgrading of hardware and that the information would be delayedto broadcast. This requires less wait time and improves accuracy of thevoice broadcast. In addition, the technical solution can also determinethe transaction voice content in the voice information to be broadcastby the transaction voice broadcast device according to the number oftransactions and/or the transaction amount of the device user, so thatthe transaction voice content broadcast by the transaction voicebroadcast device may be more consistent with the current collectionscenario of the device user, thereby realizing the flexibility andreal-time updateability of the transaction voice content to be broadcastby the transaction voice broadcast device.

In some embodiments, the actions may be performed in a different orderand still achieve the desired results. In addition, the processesdepicted in the accompanying figures do not necessarily require theparticular order shown or a sequential order to achieve the desiredresults.

One or more embodiments also provide a voice broadcast device. FIG. 3 isa block diagram of a voice broadcast device according to an embodimentof this specification. The voice broadcast device 300 may include: afirst delivering module 310, configured to acquire at least one piece ofvoice information and deliver at least one piece of voice information toa target voice broadcast device, wherein the at least one piece of voiceinformation includes voice identification information and voice content;a first determining module 320, configured to determine a first voicecontent to be broadcast by the target voice broadcast device; a firstsending module 330, configured to send a first voice identificationinformation corresponding to the first voice content to the target voicebroadcast device, wherein the first voice identification information isused by the target voice broadcast device to obtain the first voicecontent, and to broadcast the first voice content when a preset voicebroadcast condition is reached.

In one embodiment, the first determining module 320 may include: a firstdetermining unit, configured to determine whether the target voicebroadcast device needs to broadcast the voice content according to apreset voice broadcast rule;

a first determining unit, configured to determine, if the target voicebroadcast device needs to broadcast the voice content, a first voicecontent to be broadcast by the target voice broadcast device. In oneembodiment, the determining unit is further configured to: determine,when the broadcast information corresponding to the voice broadcastdevice meets a preset broadcast condition, that the target voicebroadcast device needs to broadcast the voice content. The broadcastinformation includes at least one of the total number of broadcasts inthe first designated time segment, the first duration from the lastbroadcast of the voice content, and the number of consecutive broadcastsof other voice contents in the second designated time segment.

In one embodiment, the preset broadcast condition includes at least oneof the following: the total number of broadcasts is less than a firstpreset threshold; the first duration reaches a preset duration; thenumber of consecutive broadcasts of other voice contents in the seconddesignated time segment reaches a second preset threshold.

In one embodiment, the first determining module includes at least one ofthe following: a second determining unit, configured to determine thefirst voice content according to a preset carousal broadcasting rule,wherein the carousal broadcasting rule is used to specify a carousalbroadcasting order of each voice content in at least one piece of voiceinformation; a third determining unit, configured to determine that adesignated voice content is the first voice content; a fourthdetermining unit, configured to determine the first voice contentaccording to personal information and/or preference information of adevice user.

In some embodiments, the voice information is pre-delivered to thetarget voice broadcast device to determine the first voice content to bebroadcast by the target voice broadcast device, and the first voiceidentification information corresponding to the first voice content issent to the target voice broadcast device, so that the target voicebroadcast device is capable of obtaining the first voice content locallyaccording to the first voice identification information. It isunnecessary for the target voice broadcast device to download the voicefrom the server when it needs to be broadcast, and avoid delaying whenthe voice information is downloaded in real time. Moreover, because thevoice information is downloaded in advance by the server and storedlocally, even if the voice information is changed, there is no need tomake any improvement in the device hardware, thereby solving the problemthat the update of voice information requires upgrading of hardware suchthat the information cannot be broadcast in time. This improves thetimeliness and accuracy of the voice broadcast.

FIG. 4 is a block diagram of a voice broadcast device according to anembodiment of this specification. The voice broadcast device 400 mayinclude: a second delivering module 410, configured to acquire at leastone piece of voice information to deliver at least one piece of voiceinformation to a transaction voice broadcast device, wherein the atleast one piece of voice information includes voice identificationinformation and voice content; a second determining module 420,configured to determine a transaction voice content to be broadcast bythe transaction voice broadcast device; a second sending module 430,configured to send a transaction voice identification informationcorresponding to the transaction voice content to the transaction voicebroadcast device, wherein the transaction voice identificationinformation is used by the transaction voice broadcast device to obtainthe transaction voice content, and to broadcast the transaction voicecontent when it meets a preset voice broadcast condition.

In one embodiment, the second determining module 420 includes: a seconddetermining unit, configured to determine whether the transaction voicebroadcast device needs to broadcast the voice content according to apreset voice broadcast rule; a fifth determining unit, configured todetermine, if the transaction voice broadcast device needs to broadcastthe voice content, a transaction voice content to be broadcast by thetransaction voice broadcast device.

In one embodiment, the second determining module 420 includes: a sixthdetermining unit, configured to, according to the number of transactionsand/or the transaction amount of the device user, determine thetransaction voice content to be broadcast by the transaction voicebroadcast device. The sixth determining unit is further configured to:acquire a preset mapping relationship between a range of the number oftransactions and/or a range of transaction amounts and the voicecontent; determine a range of a first number of transactions withinwhich the number of transactions falls and/or determine a firsttransaction amount range within which the transaction amount falls;determine, according to the range of the first number of transactions,the first transaction amount range, and the mapping relationship, thetransaction voice content to be broadcast by the transaction voicebroadcast device.

In some embodiments, the flexibility and real-time updateability of thetransaction voice content to be broadcast by the transaction voicebroadcast device may be realized when the transaction voice content inthe voice information to be broadcast by the transaction voice broadcastdevice according to the number of transactions and/or the transactionamount of the device user is determined such that the transaction voicecontent broadcast by the transaction voice broadcast device may be moreconsistent with the current collection scenario of the device user.

Those skilled in the art should understand that the above-mentionedvoice broadcast device is capable of being used to implement theaforementioned voice broadcast method, and the detailed descriptionthereof may be similar to that described in the method above.

One or more embodiments of this specification also provide a voicebroadcast device, as shown in FIG. 5. The voice broadcast device mayhave a relatively large difference due to different configurations orperformance and may include one or more processors 501 and a memory 502.The memory 502 may store one or more storage applications or data. Thememory 502 may be a temporary storage or a persistent storage. Theapplication program stored in the memory 502 may include one or moremodules (not shown in the figure), and each module may include a seriesof computer-executable instructions in a voice broadcast device.Furthermore, the processor 501 may be configured to communicate with thememory 502 and execute a series of computer-executable instructions inthe memory 502 on the voice broadcast device. The voice broadcast devicemay further include one or more power sources 503, one or more wired orwireless network interfaces 504, one or more input/output interfaces505, and one or more keyboards 506.

In some embodiments, the voice broadcast device includes a memory andone or more programs, wherein one or more programs are stored in thememory, and one or more programs may include one or more modules, andeach module may include a series of computer-executable instructions ina voice broadcast device, and configured for one or more processors toexecute one or more programs which include the followingcomputer-executable instructions: acquiring at least one piece of voiceinformation to deliver the at least one piece of voice information to atarget voice broadcast device, where the at least one piece of voiceinformation includes voice identification information and voice content;determining a first voice content to be broadcast by the target voicebroadcast device; sending the first voice identification informationcorresponding to the first voice content to the target voice broadcastdevice, wherein the first voice identification information is used bythe target voice broadcast device to obtain the first voice content, andto broadcast the first voice content when it meets a preset voicebroadcast condition.

In some embodiments, when the computer-executable instructions areexecuted, the processor may also be caused to: the determining a firstvoice content to be broadcast by said target voice broadcast device,including: determining, according to a preset voice broadcast rule,whether the target voice broadcast device needs to broadcast the voicecontent; determine, if the target voice broadcast device needs tobroadcast the voice content, the first voice content to be broadcast bythe target voice broadcast device.

In some embodiments, when the computer-executable instructions areexecuted, said processor may also be caused to: determine, when thebroadcast information corresponding to the target voice broadcast devicemeets a preset broadcast condition, that the target voice broadcastdevice needs to broadcast the voice content, wherein the broadcastinformation includes at least one of the total number of broadcasts inthe first designated time segment, the first duration since the lastbroadcast of the voice content, and the number of consecutive broadcastsof other voice contents in the second designated time segment.

In some embodiments, the preset broadcast conditions include at leastone of the following: the total number of broadcasts is less than afirst preset threshold; the first duration reaches a preset duration;the number of consecutive broadcasts of other voice contents in thesecond designated time segment reaches a second preset threshold.

In some embodiments, when the computer-executable instructions areexecuted, said processor may also be caused to: determine the firstvoice content according to a preset carousal broadcasting rule, whereinthe carousal broadcasting rule is used to specify a carousalbroadcasting order of the voice contents in the at least one voicemessage; determine that the designated voice content is the first voicecontent; determine the first voice content according to personalinformation and/or preference information of a device user.

In some embodiments, when the computer-executable instructions areexecuted, said processor may also be caused to: acquire relatedinformation of the target voice broadcast device, wherein the relatedinformation includes at least one of the following: user information ofa device user, behavior information of the device user, preferenceinformation of the device user, and device location information; acquirethe at least one piece of voice information corresponding to said targetvoice broadcast device according to the related information.

In some embodiments, when the computer-executable instructions areexecuted, said processor may also be caused to: determine, if there iscurrently a task of delivering voice information to other voicebroadcast devices, priorities respectively corresponding to the targetvoice broadcast device and the other voice broadcast devices; determinethe delivery time of the at least one piece of voice informationaccording to the priorities respectively corresponding to the targetvoice broadcast device and the other voice broadcast devices, anddelivering the at least one piece of voice information to the targetvoice broadcast device according to the delivery time, wherein thedelivery time includes the time before the voice information isdelivered to the other voice broadcast devices, or the time after thevoice information is delivered to the other voice broadcast devices.

In some embodiments, when the computer-executable instructions areexecuted, said processor may also be caused to: obtain, when the atleast one piece of voice information is updated, the updated at leastone piece of voice information, wherein the update to the voiceinformation includes at least one action of adding, deleting, andmodifying the voice information; deliver the updated voice informationto the target voice broadcast device.

One or more embodiments of this specification also provide a voicebroadcast device, as shown in FIG. 6. The voice broadcast device mayhave a relatively large difference due to different configurations orperformance and may include one or more processors 601 and a memory 602.The memory 602 may store one or more storage applications or data. Thememory 602 may be a temporary storage or a persistent storage. Theapplication program stored in the memory 602 may include one or moremodules (not shown in the figure), and each module may include a seriesof computer-executable instructions in a voice broadcast device.Furthermore, the processor 601 may be configured to communicate with thememory 602 and execute a series of computer-executable instructions inthe memory 602 on the voice broadcast device. The voice broadcast devicemay further include one or more power sources 603, one or more wired orwireless network interfaces 604, one or more input/output interfaces605, and one or more keyboards 606.

In some embodiments, the voice broadcast device includes a memory andone or more programs, wherein one or more programs are stored in thememory, and one or more programs may include one or more modules, andeach module may include a series of computer-executable instructions ina voice broadcast device, and may be configured for one or moreprocessors to execute one or more programs which include the followingcomputer-executable instructions: acquiring at least one piece of voiceinformation; and delivering said at least one piece of voice informationto a transaction voice broadcast device, wherein the voice informationincludes voice identification information and voice content; determininga transaction voice content to be broadcast by the transaction voicebroadcast device; sending the transaction voice identificationinformation corresponding to the transaction voice content to thetransaction voice broadcast device, wherein the transaction voiceidentification information is used by the transaction voice broadcastdevice to obtain the transaction voice content, and to broadcast thetransaction voice content when a preset voice broadcast condition isreached.

In some embodiments, when the computer-executable instructions areexecuted, the processor may also be caused to: determine, according to apreset voice broadcast rule, whether the transaction voice broadcastdevice needs to broadcast said voice content; determine, if thetransaction voice broadcast device needs to broadcast said voicecontent, a transaction voice content to be broadcast by the transactionvoice broadcast device. In some embodiments, when thecomputer-executable instructions are executed, the processor may also becaused to: determine transaction voice content to be broadcast by thetransaction voice broadcast device according to the number oftransactions and/or the transaction amount of the device user.

In some embodiments, when the computer-executable instructions areexecuted, the processor may also be caused to: acquire a preset mappingrelationship between a range of the number of transactions and/or arange of transaction amounts and the voice content; determine a range ofa first number of transactions within which the number of transactionsfalls and/or determine a first transaction amount range within which thetransaction amount falls; determine, according to the range of the firstnumber of transactions, the first transaction amount range, and themapping relationship, the transaction voice content to be broadcast bythe transaction voice broadcast device.

One or more embodiments of this specification also provide acomputer-readable storage medium that stores one or more programs, andthe one or more programs include instructions. When the instructions areexecuted by an electronic device including multiple applicationprograms, the electronic device may be caused to execute the foregoingvoice broadcast method and is specifically configured to execute thefollowing: acquiring at least one piece of voice information; deliveringthe at least one piece of voice information to a target voice broadcastdevice, wherein the at least one piece of voice information includesvoice identification information and voice content; determining a firstvoice content to be broadcast by the target voice broadcast device;sending the first voice identification information corresponding to thefirst voice content to the target voice broadcast device, wherein thefirst voice identification information is used by the target voicebroadcast device to obtain the first voice content, and to broadcast thefirst voice content when a preset voice broadcast condition is reached.

One or more embodiments of this specification also provide acomputer-readable storage medium that stores one or more programs, andthe one or more programs include instructions. When the instructions areexecuted by an electronic device including multiple applicationprograms, the electronic device may be caused to execute the foregoingvoice broadcast method and is specifically configured to execute thefollowing: acquiring at least one piece of voice information; deliveringthe at least one piece of voice information to a transaction voicebroadcast device, wherein the voice information includes voiceidentification information and voice content; determining a transactionvoice content to be broadcast by the transaction voice broadcast device;sending the transaction voice identification information correspondingto the transaction voice content to the transaction voice broadcastdevice, wherein the transaction voice identification information is usedby the transaction voice broadcast device to obtain the transactionvoice content, and to broadcast the transaction voice content when itmeets a preset voice broadcast condition.

The system, device, module, or unit described in the foregoingembodiments may be specifically implemented by a computer chip orentity, or a product with a certain function. A typical implementationdevice is a computer. Specifically, the computer may be, for example, apersonal computer, a laptop computer, a cellular phone, a camera phone,a smart phone, a personal digital assistant, a media player, anavigation device, an email device, a game console, a tablet computer, awearable device, or a combination of any of these devices.

For the convenience of description, when the above device is described,the functions are divided into various units and described separately.Certainly, when one or more embodiments of this specification areimplemented, the functions of each unit may be implemented in the sameor a multiple pieces of software and/or hardware.

Those skilled in the art should understand that one or more embodimentsof this specification may be provided as a method, a system, or acomputer program product. Therefore, one or more embodiments of thisspecification may take the form of an entirely hardware embodiment, anentirely software embodiment, or an embodiment combining software andhardware aspects. Moreover, one or more embodiments of thisspecification may adopt a form of a computer program product implementedon one or more computer-usable storage media (including but not limitedto disk storage, CD-ROM, optical storage, etc.) containingcomputer-usable program codes therein.

One or more embodiments of this specification are described withreference to flowcharts and/or block diagrams of methods, devices(systems), and computer program products according to embodiments of thepresent specification. Each process and/or block in the flowchartsand/or block diagrams, and combinations of processes and/or blocks inthe flowcharts and/or block diagrams may be implemented by computerprogram instructions. These computer program instructions may beprovided to a processor of a general-purpose computer, special-purposecomputer, embedded processor, or other programmable data processingdevice to produce a machine, so that the instructions generated by theprocessor of the computer or other programmable data processing deviceare used to generate an device for implementing the functions specifiedin one or more flows in the flowcharts and/or one or more blocks in theblock diagrams.

These computer program instructions may also be stored in acomputer-readable memory capable of directing a computer or otherprogrammable data processing devices to work in a specific manner suchthat the instructions stored in the computer-readable memory produce amanufactured article including an instruction apparatus. The instructionapparatus implements the functions specified in one or more flows in theflowcharts and/or one or more blocks in the block diagrams.

These computer program instructions may also be loaded onto a computeror other programmable data processing devices, so that a series of stepsmay be performed on the computer or other programmable devices toproduce a computer-implemented process, which may be executed on thecomputer or other programmable devices. The instructions provide stepsfor implementing functions specified in one or more flows in theflowcharts and/or one or more blocks in the block diagrams.

In a typical configuration, a computing device includes one or moreprocessors (CPUs), input/output interfaces, network interfaces, andmemory.

A memory may include non-persistent memory, random access memory (RAM),and/or non-volatile memory in computer-readable media, such as read-onlymemory (ROM) or flash memory (flash RAM). Memory is an example of acomputer-readable medium.

Computer-readable media includes permanent and non-persistent, removableand non-removable media. Information storage may be implemented by anymethod or technology. Information may be computer-readable instructions,data structures, modules of a program, or other data. Examples ofcomputer storage media include, but are not limited to, phase changememory (PRAM), static random access memory (SRAM), dynamic random accessmemory (DRAM), other types of random access memory (RAM), and read-onlymemory (ROM), electrically erasable programmable read-only memory(EEPROM), flash memory or other memory technology, read-only discread-only memory (CD-ROM), digital versatile disc (DVD) or other opticalstorage, magnetic tape cartridges, magnetic tape storage or othermagnetic storage devices or any other non-transmitting medium be used tostore information that may be accessed by a computing device. As definedherein, computer-readable media does not include temporarycomputer-readable media (transitory media), such as modulated datasignals and carriers.

The terms “including”, “comprising” or any other variants thereof areintended to cover non-exclusive inclusion, so that a process, method,product or device that includes a series of elements includes not onlythose elements but also other elements not explicitly listed, or thosethat are inherent to such a process, method, product, or device. Withoutmore restrictions, the elements defined by the sentence “including a . .. ” do not exclude the existence of other identical elements in theprocess, method, product or device including the elements.

One or more embodiments of this specification may be described in thegeneral context of computer-executable instructions executed by acomputer, such as program modules. Generally, program modules includeroutines, programs, objects, components, and data structures thatperform specific tasks or implement specific abstract data types. Thepresent specification may also be practiced in distributed computingenvironments in which tasks are performed by remote processing devicesconnected through a communications network. In a distributed computingenvironment, program modules may be in local and remote computer storagemedia, including storage devices.

Each embodiment in this specification is described in a progressivemanner, and the same or similar parts between the various embodimentsmay be referred to each other. Each embodiment focuses on thedifferences from other embodiments. In particular, for the systemembodiment, since it is basically similar to the method embodiment, thedescription is relatively simple. For the relevant parts, refer to thedescription of the method embodiment.

The foregoing descriptions are merely one or more embodiments of thisspecification but are not intended to limit the protection scope of thisspecification. For those skilled in the art, various modifications andchanges may be made to one or more embodiments of this specification.Any modification, equivalent replacement, and improvement made withinthe spirit and principle of one or more embodiments of thisspecification shall be included in the scope of claims of one or moreembodiments of this specification.

What is claimed is:
 1. A voice broadcast method, comprising: acquiringat least one piece of voice information; delivering the at least onepiece of voice information to a voice broadcast device, wherein, the atleast one piece of voice information includes voice identificationinformation and a voice content; determining a first voice content to bebroadcast by the voice broadcast device, wherein the first voice contentis included in the at least one piece of voice information stored at thevoice broadcast device; and sending, to the voice broadcast device, afirst voice identification information corresponding to the first voicecontent, wherein the first voice identification information is used bythe voice broadcast device to obtain the first voice content stored atthe voice broadcast device, and to broadcast the first voice contentwhen a preset voice broadcast condition is met.
 2. The method accordingto claim 1, wherein the determining a first voice content to bebroadcast by the voice broadcast device comprises: determining,according to a preset voice broadcast rule, whether the target voicebroadcast device needs to broadcast a voice content; and determining, inresponse to determining that the voice broadcast device needs tobroadcast a voice content, that the first voice content is to bebroadcast by the voice broadcast device.
 3. The method according toclaim 2, wherein the determining, according to a preset voice broadcastrule, whether the voice broadcast device needs to broadcast a voicecontent comprises: determining, that the voice broadcast device needs tobroadcast the first voice content, in response to: a total number ofbroadcasts in a first designated time segment is less than a firstpreset threshold, a first duration from the last broadcast of a voicecontent reaches a preset duration, or a number of consecutive broadcastsof voice contents in a second designated time segment reaches a secondpreset threshold.
 4. The method according to claim 1, wherein thedetermining a first voice content to be broadcast by the voice broadcastdevice comprises one of the following: determining the first voicecontent as a designated voice content according to a preset carousalbroadcasting rule, wherein the carousal broadcasting rule is configuredto specify a broadcasting order of voice contents in the at least onepiece of voice information; and determining the first voice contentaccording to personal information or preference information of a userassociated with the voice broadcast device.
 5. The method according toclaim 1, wherein the acquiring at least one piece of voice informationcomprises: acquiring related information of the voice broadcast device,wherein the related information includes at least one of the following:user information of a user associated with the voice broadcast device,behavior information of the user, preference information of the user,and location information of the voice broadcast device; and acquiring,according to the related information, the at least one piece of voiceinformation corresponding to the voice broadcast device.
 6. The methodaccording to claim 1, wherein the delivering the at least one piece ofvoice information to a voice broadcast device comprises: determining, ifthere is a task of delivering voice information to other voice broadcastdevices, priorities respectively corresponding to the voice broadcastdevice and other voice broadcast devices; and determining, according tothe priorities respectively corresponding to the voice broadcast deviceand the other voice broadcast devices, a delivery time of the at leastone piece of voice information, and delivering, according to thedelivery time, the at least one piece of voice information to the voicebroadcast device.
 7. The method according to claim 1, furthercomprising: obtaining, when an update is performed on the at least onepiece of voice information, the updated at least one piece of voiceinformation, wherein the update includes at least one of adding,deleting, or modifying the at least one piece of voice information; anddelivering the updated at least one piece of voice information to thevoice broadcast device.
 8. The method according to claim 1, wherein thefirst voice content includes a transaction voice content, and thedetermining a first voice content to be broadcast by the voice broadcastdevice comprises: determining, according to a number of transactions ora transaction amount of a user associated with the voice broadcastdevice, the transaction voice content to be broadcast by the voicebroadcast device.
 9. The method according to claim 8, wherein thedetermining, according to a number of transactions or a transactionamount of the user, the transaction voice content to be broadcast by thevoice broadcast device comprises: acquiring a preset mappingrelationship between a range of the number of transactions or a range oftransaction amounts and the voice content; determining a range of afirst number of transactions, wherein the number of transactions fallswithin the range of the first number of transactions, or determining afirst transaction amount range, wherein the transaction amount fallswithin the first transaction amount range; and determining, according tothe range of the first number of transactions, the first transactionamount range, and the mapping relationship, the transaction voicecontent to be broadcast by the voice broadcast device.
 10. Anon-transitory computer-readable storage medium configured withinstructions executable by one or more processors to cause the one ormore processors to perform operations comprising: acquiring at least onepiece of voice information; delivering the at least one piece of voiceinformation to a voice broadcast device, wherein, the at least one pieceof voice information includes voice identification information and avoice content; determining a first voice content to be broadcast by thevoice broadcast device, wherein the first voice content is included inthe at least one piece of voice information stored at the voicebroadcast device; and sending, to the voice broadcast device, a firstvoice identification information corresponding to the first voicecontent, wherein the first voice identification information is used bythe voice broadcast device to obtain the first voice content stored atthe voice broadcast device, and to broadcast the first voice contentwhen a preset voice broadcast condition is met.
 11. The non-transitorycomputer-readable storage media according to claim 10, wherein thedetermining a first voice content to be broadcast by the voice broadcastdevice comprises: determining, according to a preset voice broadcastrule, whether the target voice broadcast device needs to broadcast avoice content; and determining, in response to determining that thevoice broadcast device needs to broadcast a voice content, that thefirst voice content is to be broadcast by the voice broadcast device.12. A voice broadcast device, comprising: one or more processors; and amemory storing instructions executable by the one or more processors tocause the one or more processors to perform operations comprising:acquiring at least one piece of voice information; delivering the atleast one piece of voice information to a voice broadcast device,wherein, the at least one piece of voice information includes voiceidentification information and a voice content; determining a firstvoice content to be broadcast by the voice broadcast device, wherein thefirst voice content is included in the at least one piece of voiceinformation stored at the voice broadcast device; and sending, to thevoice broadcast device, a first voice identification informationcorresponding to the first voice content, wherein the first voiceidentification information is used by the voice broadcast device toobtain the first voice content stored at the voice broadcast device, andto broadcast the first voice content when a preset voice broadcastcondition is met.
 13. The device according to claim 12, wherein thedetermining a first voice content to be broadcast by the voice broadcastdevice comprises: determining, according to a preset voice broadcastrule, whether the target voice broadcast device needs to broadcast avoice content; and determining, in response to determining that thevoice broadcast device needs to broadcast a voice content, that thefirst voice content is to be broadcast by the voice broadcast device.14. The device according to claim 13, wherein the determining, accordingto a preset voice broadcast rule, whether the voice broadcast deviceneeds to broadcast the voice content comprises: determining, that thevoice broadcast device needs to broadcast the first voice content, inresponse to: a total number of broadcasts in a first designated timesegment is less than a first preset threshold, a first duration from thelast broadcast of a voice content reaches a preset duration, or a numberof consecutive broadcasts of voice contents in a second designated timesegment reaches a second preset threshold.
 15. The device according toclaim 12, wherein the determining a first voice content to be broadcastby the voice broadcast device comprises one of the following:determining the first voice content as a designated voice contentaccording to a preset carousal broadcasting rule, wherein the carousalbroadcasting rule is configured to specify a broadcasting order of voicecontents in the at least one piece of voice information; and determiningthe first voice content according to personal information or preferenceinformation of a user associated with the voice broadcast device. 16.The device according to claim 12, wherein the acquiring at least onepiece of voice information comprises: acquiring related information ofthe voice broadcast device, wherein the related information includes atleast one of the following: user information of a user associated withthe voice broadcast device, behavior information of the user, preferenceinformation of the user, and location information of the voice broadcastdevice; and acquiring, according to the related information, the atleast one piece of voice information corresponding to the voicebroadcast device.
 17. The device according to claim 12, wherein thedelivering the at least one piece of voice information to a voicebroadcast device comprises: determining, if there is a task ofdelivering voice information to other voice broadcast devices,priorities respectively corresponding to the voice broadcast device andother voice broadcast devices; determining, according to the prioritiesrespectively corresponding to the voice broadcast device and the othervoice broadcast devices, a delivery time of the at least one piece ofvoice information, and delivering, according to the delivery time, theat least one piece of voice information to the voice broadcast device.18. The device according to claim 12, wherein the operations furthercomprise: obtaining, when an update is performed on the at least onepiece of voice information, the updated at least one piece of voiceinformation, wherein the update includes at least one of adding,deleting, or modifying the at least one piece of voice information; anddelivering the updated at least one piece of voice information to thevoice broadcast device.
 19. The device according to claim 12, whereinthe first voice content includes a transaction voice content, and thedetermining a first voice content to be broadcast by the voice broadcastdevice comprises: determining, according to a number of transactions ora transaction amount of a user associated with the voice broadcastdevice, the transaction voice content to be broadcast by the voicebroadcast device.
 20. The device according to claim 19, wherein thedetermining, according to a number of transactions or a transactionamount of the user, the transaction voice content to be broadcast by thevoice broadcast device comprises: acquiring a preset mappingrelationship between a range of the number of transactions or a range oftransaction amounts and the voice content; determining a range of afirst number of transactions, wherein the number of transactions fallswithin the range of the first number of transactions, or determining afirst transaction amount range, wherein the transaction amount fallswithin the first transaction amount range; and determining, according tothe range of the first number of transactions, the first transactionamount range, and the mapping relationship, the transaction voicecontent to be broadcast by the voice broadcast device.